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I  The  M  genome  segment  of  Hantaan  virus  was  molecularly  cloned  and  the  nucleotide  sequence  of  cDNA  was  de- 
|  termined.  The  virion  RNA  is  3616  bases  long  with  3-  and  5'-terminal  nucleotide  sequences  complementary  for  18 
bases.  A  single  long  open  reading  frame  in  the  viral  complementary-sense  RNA  had  the  potential  to  encode  1135 

I  amino  acids  or  a  polypeptide  of  126,000  Da.  Amino-terminal  sequences  of  isolated  G1  and  G2  envelope  glycoproteins 
were  determined,  revealing  a  gene  order  with  respect  to  message  sense  RNA  of  5-G1-G2-3'.  Mature  G1  begins  18 
|  amino  acids  beyond  the  first  AUG  of  the  open  reading  frame,  preceded  by  a  short,  hydrophobic  leader  sequence.  G2 
■  begins  at  the  649th  amino  acid  of  the  open  reading  frame  and  also  follows  a  hydrophobic  sequence.  Carboxy  termini 

t  of  G1  and  G2  were  localized  and  gene  order  was  verified  by  immune  precipitation  of  Hantaan  proteins  with  antisera 

^  ■  to  synthetic  peptides  generated  by  using  amino  acid  sequences  derived  from  the  cDNA  sequence.  The  antipeptide 
sera  were  also  reactive  by  immunoblotting  with  SDS-denatured  G1  and  G2.  Molecular  weights  of  64,000  and  53,700 
were  calculated  for  the  G1  and  G2  glycoproteins,  respectively,  from  their  predicted  amino  acid  sequences.  Five  potential 
V  asparagine-linked  glycosyiation  sites  were  contained  within  the  G1  amino  acid  sequence  and  two  within  the  G2  sequence. 

C  These  data  are  consistent  with  our  previous  estimates  of  the  molecular  weights  and  extent  of  glycosyiation  of  the 
Hantaan  envelope  glycoproteins,  ©  1987  Academic  Press,  inc.  . 


INTRODUCTION 

Hantaan  virus  is  the  type  species  of  the  Hantavirus 
genus  of  the  Bunyaviridae  family.  Hantaviruses  have 
been  associated  with  many  clinically  similar  diseases 
known  collectively  as  hemorrhagic  fever  with  renal 
syndrome.  Hantaan  virus  is  the  etiologic  agent  of  Ko¬ 
rean  hemorrhagic  fever,  which  is  one  of  the  better 
known  and  most  severe  diseases  included  in  the  syn¬ 
drome.  Like  other  Bunyaviridae,  Hantaan  possesses  a 
three-segmented,  single-stranded  RNA  genome  of 
negative  polarity  (Schmaljohn  and  Dalrymple,  1983). 
The  large  (L),  medium  (M),  and  small  (S)  genome  seg¬ 
ments  have  relative  molecular  masses  (M,)  of  approx¬ 
imately  2.7,  1.2,  and  0.6  X  10®,  respectively,  as  esti¬ 
mated  by  agarose  gel  electrophoresis,  and  are  en¬ 
closed  in  three  separate  nucleocapsid  structures  which 
are  surrounded  by  a  lipid  envelope  containing  two  virus- 
specified  glycoproteins  designated  G1  and  G2 
(Schmaljohn  et  at.,  1983;  Schmaljohn  and  Dalrymple, 
1983,  1984;  Elliot  ef  a/.,  1984). 

Although  the  coding  strategies  of  viruses  in  different 
genera  of  the  Bunyaviridae  have  been  shown  to  vary 
considerably,  all  viruses  examined  to  date  encode  their 
nucleocapsid  protein  and  a  nonstructural  protein  in  the 
S  segment  and  their  envelope  glycoproteins  in  the  M 
segment.  By  deduction,  the  L  segment  is  believed  to 

1  To  whom  reprint  requests  should  be  addressed. 


encode  the  virion-associated  polymerase  (reviewed  in 
Bishop,  1985). 

Our  studies  on  the  basic  molecular  mechanisms  of 
Hantaan  replication,  including  the  coding  strategies  of 
the  genome  segments,  have  been  directed  toward  a 
better  understanding  of  the  viral  infection  process.  We 
have  found  that  Hantaan,  like  other  viruses  in  the  family, 
encodes  its  nucleocapsid  protein  in  the  S  RNA  segment 
(Schmaljohn  efa/.,  1986b).  Sequence  analysis  of  cloned 
cDNA  from  the  S  segment  revealed  only  a  single  long 
open  reading  frame  (ORF)  in  the  viral  complementary 
sense  RNA,  which  was  shown  to  include  the  gene  for 
the  nucleocapsid  protein  (Schmaljohn  ef  a/.,  1986c). 
The  only  other  potential  gene  product  was  a  6K  poly¬ 
peptide  (48  amino  acids),  encoded  in  the  same  reading 
frame  as  the  major  ORF,  initiating  at  an  AUG  three 
bases  beyond  the  ochre  stop  codon  of  the  nucleocap¬ 
sid  protein.  However,  the  actual  existence  of  any  non¬ 
structural  polypeptide  has  not  yet  been  demonstrated. 
These  data  suggested  that  the  coding  strategy  of  the 
S  segment  of  Hantaan  differs  considerably  from  the 
elaborate  ambisense  strategy  of  the  phlebovirus  S 
segment  and  the  overlapping  reading  frame  strategy 
employed  by  the  bunyavirus  S  segment  to  encode  both 
their  nucleocapsid  protein  and  a  nonstructural  (NSJ 
polypeptide  (Bishop,  1 985). 

Members  of  the  Bunyaviridae  for  which  sequence 
data  of  the  M  genome  segment  have  been  reported 
include  two  viruses  in  the  Phlebovirus  genus,  Rift  Valley 
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fever  and  Punta  Toro  viruses,  and  two  viruses  in  the 
Bunyavirus  genus,  snowshoe  hare  and  Bunyamwera 
viruses  (Collett  era/.,  1985;  lhara  eta!.,  1985;  Eshita 
and  Bishop,  1 984;  Lees  et  a!.,  1 986).  All  of  these  viruses 
were  found  to  encode  single  gene  products  in  the  viral 
complementary-sense  RNA.  Amino-terminal  sequence 
analysis  of  the  phlebovirus  glycoproteins  indicated  that 
both  Gl  and  G2  were  contained  within  that  gene  prod¬ 
uct,  and  sufficient  coding  information  preceded  the 
amino  terminus  of  the  first  encoded  glycoprotein  to 
produce  a  polypeptide  of  1 4K  for  Rift  Valley  fever  or 
30K  for  Punta  Toro  viruses.  This  putative  polypeptide, 
termed  NSM ,  has  not  been  identified  in  infected  cells, 
and  its  significance  remains  unknown. 

Although  the  gene  order  has  not  yet  been  determined 
for  members  of  the  bunyavirus  genus,  the  M  segments 
of  these  viruses  encode  a  NSM  polypeptide  in  addition 
to  Gl  and  G2.  While  NSM  proteins  have  been  observed 
in  bunyavirus-infected  cells  (Fuller  and  Bishop,  1 982), 
functions  for  these  polypeptides  have  similarly  not  been 
assigned. 

To  our  knowledge,  sequence  information  is  not  yet 
available  for  viruses  in  the  Uukuvirus  or  Nairovirus  gen¬ 
era.  However,  the  Gl  and  G2  proteins  of  Uukuniemi 
virus  have  been  shown  to  be  processed  from  a  1 1 0K 
polyprotein  precursor  in  cell-free  translation  studies 
(Ulmanen  etal.,  1981),  andGl  was  found  to  be  amino 
terminal  and  G2  carboxy  terminal  in  the  1 1 0K  precursor 
by  pulse-chase  labeling  of  infected  cells  (Kuismanen, 
1984). 

Therefore,  unlike  the  S  genome  segments  of  Bun- 
yaviridae,  which  demonstrate  widely  disparate  coding 
strategies  among  the  five  genera,  the  M  RNA  segments 
of  all  characterized  viruses  in  this  family  appear  to  en¬ 
code  polyprotein  precursors  which  are  processed  into 
envelope  glycoproteins.  We  report  here  that  the  M  ge¬ 
nome  segment  of  Hantaan  virus  conforms  to  this  gen¬ 
eral  observation,  localization  of  the  proteins  within  the 
derived  amino  acid  sequence,  however,  suggests  that 
Hantaan  may  diffeV  from  other  viruses  in  the  processing 
of  its  envelope  proteins. 

MATERIALS  AND  METHODS 

Molecular  cloning  and  sequence  analysis  of 
Hantaan  virus  M  RNA 

Hantaan  virus  was  propagated  in  Vero  E6  cells 
(C 1 008,  ATCC),  purified  by  sucrose  gradient  sedimen¬ 
tation,  and  RNA  was  extracted  from  virions  as  previ¬ 
ously  described  (Schmaljohn  et  at.,  1 983;  Schmaljohn 
and  Dalrymple,  1 983).  First-strand  cDNA  synthesis  was 
primed  by  the  addition  of  a  synthetic  oligonucleotide 


complementary  to  1 9  bases  at  the  3'  end  of  virion  M 
RNA  (5-TAGTAGTAGACTCCGCAAA-3')  (Schmaljohn 
and  Dalrymple,  1983).  Double-stranded  cDNA  was 
synthesized,  tailed,  and  inserted  into  the  Pst\  site  of 
pBR322,  and  recombinant  plasmids  were  identified  as 
previously  described  (Schmaljohn  et  at.,  1 986c).  Se¬ 
quence  analysis  was  performed  by  the  dideoxy  chain 
termination  method  of  Sanger  etal.,  (1977),  using  syn¬ 
thetic  oligonucleotides  to  prime  synthesis  on  cDNA 
subcloned  into  the  Ml 3  bacteriophage  (Messing, 

1 983).  Sequence  data  were  analyzed  with  the  Intelli- 
genetics  sequence  analysis  program  on  a  DEC  VAX 
11/750  computer  or  with  the  University  of  Minnesota 
sequence  analysis  program  and  an  Apple  lie  computer. 

Amino-terminal  sequence  analysis  of  Gl  and  G2 

Hantaan  Gl  and  G2  glycoproteins  were  prepared  for 
amino-terminal  sequence  analysis  by  electrophoresis 
through  polyacrylamide  gels  by  the  method  of  Dietz- 
schold  et  at.  (1986).  Proteins  were  boiled  in  65  m M 
Tris-borate,  pH  8.4,  containing  19b  SDS,  59b  2-mer- 
captoethanol,  and  109b  glycerol  and  applied  to  a  3- 
mm-thick,  discontinuous,  polyacry  n  .de  gel.  A  60-ml 
separating  gel  of  109b  acrylamide  find  0.059b  bisacry- 
iamide  was  prepared  in  0.375  M  Iris-sulfate,  pH  8.4. 
A  20-ml  stacking  gel  of  79b  acrylamide  and  0.359b  bis- 
acrylamide  was  prepared  in  0.17  M  Tris-sulfate,  pH 
8.4.  Gels  were  allowed  to  polymerize  for  at  least  24  hr 
and  were  electrophoresed  at  1 00  V  for  approximately 
12  hr,  after  which  they  were  sliced  horizontally.  Proteins 
were  recovered  from  gel  slices  by  agitation  in  50  m M 
ammonium  bicarbonate  containing  0.1%  SDS  for  two 
separate  12-hr  periods.  Samples  were  lyophilized  and 
dialyzed  extensively  against  water.  Approximately  1 00 
pmol  of  purified  proteins  was  sequenced  on  Model 
470 A  protein  sequencer  (Applied  Biosystems  Inc.,  San 
Francisco,  CA)  and  analyzed  with  a  1 20A  on-line  PTH 
analyzer. 

Peptide  synthesis,  immune  precipitation,  and 
immunoblot  analysis 

Peptides  representing  derived  amino  acid  sequences 
were  synthesized  with  an  Applied  Biosystems,  Inc. 
Model  430A  peptide  synthesizer.  Peptides  were 
cleaved  from  the  resin  with  hydrofluoric  acid  (Tam  and 
Merrifield,  1 985;  Stewart  and  Young,  1 984)  and  cou¬ 
pled  to  keyhole  limpet  hemocyanin  (KLH)  (Liu  et  at., 
1979). 

Approximately  1-1.5  mg  of  each  coupled  peptide 
was  injected  intradermally  with  Freund’s  complete  ad¬ 
juvant  into  four  shaved  sites  on  each  of  two  young 
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female  New  Zealand  white  rabbits.  Rabbits  were 
boosted  with  approximately  1 .5  mg  of  coupled  peptides 
in  Freund's  incomplete  adjuvant  at  2-week  intervals  and 
bled  from  10  to  14  days  postboost.  Rabbit  sera  were 
preadsorbed  with  uninfected  Vero  E6  cells  prior  to  use 
in  Hantaan  immune-precipitation  experiments.  Poly¬ 
clonal  rabbit  antisera  consisted  of  convalescent  sera 
collected  4-12  weeks  following  infection  of  New  Zea¬ 
land  White  rabbits  with  Hantaan  virus  (strain  76-118). 
Intracellular  viral  proteins  were  radiolabeled  with 
P5S]methionine  (100  pC i/25-cmz  flask  of  confluent  Vero 
E6  cells)  from  24  to  48  hr  postinfection.  Cells  were 
lysed  on  ice  in  buffer  containing  10  m M  Tris-HCI,  pH 
8.0;  1  m M  EDTA;  0.5 M  NaCI;  0.25  mg/ml  each  of 
aprotinin  and  a2-macroglobulin  (Sigma,  St.  Louis,  MO); 
and  49b  Zwittergent  3-14,  (Calbiochem-Behring,  San 
Diego,  CA).  Cell  nuclei  were  removed  by  centrifugation 
at  1 3,000  g  for  5  min  at  4°.  Virus-specific  proteins  were 
immune  precipitated  from  infected  cell  lysates  by  in¬ 
cubation  with  antisera  on  ice  overnight  followed  by  the 
addition  of  1 00  pi  of  509b  protein  A-Sepharose  (Sigma) 
and  continued  incubation  for  30  min.  Precipitates  were 
washed  three  times  in  0.5X  lysis  buffer  and  once  in  10 
mM  Tris-HCI,  pH  6.7.  Electrophoresis  was  performed 
at  100  V  for  1 2  hr  in  SDS-containing  polyacrylamide 
gels  as  previously  described  (Schmaljohn  eta/.,  1 983). 

For  immunoblot  studies,  proteins  were  electropho- 
retically  transferred  from  polyacrylamide  gels  to  nitro¬ 
cellulose  membranes  (Hoefer  Scientific  Instruments, 
San  Francisco,  CA).  Nonspecific  sites  were  blocked  by 
incubation  of  membranes  in  a  1 09b  nonfat  dry  milk  so¬ 
lution  overnight  at  room  temperature  (Johnson  et  a/., 
1 984)  and  were  probed  with  antisera  (1 :50  dilution)  for 
45  min  at  37°  in  a  Decaprobe  hybridization  chamber 
(Hoefer  Scientific  Instruments,  San  Fancisco,  CA).  Fol¬ 
lowing  a  10-min  wash  in  phosphate-buffered  saline 
(PBS,  Sigma),  two  washes  for  20  min  each  in  PBS  with 
0059b  NP-40  (Bethesda  Research  Laboratories, 
Gaithersburg,  MD),  and  one  10-min  wash  in  PBS,  0.5 
/iCi/lane  of  35S-protein  A  (Amersham,  Arlington  Heights, 
IL)  was  added,  and  incubation  was  continued  for  an 
additional  30  min  at  room  temperature.  Washing  was 
repeated  and  the  membrane  was  dried  and  exposed 
to  X-ray  film  (Kodak  XAR,  Rochester,  NY)  at  -70°. 

RESULTS 

Sequence  analysis  of  cDNA  and  features  of 
Hantaan  virus  M  RNA 

Hantaan-specific  cDNA  inserts  were  excised  from 
pBR322  plasmids,  sized,  and  restriction  mapped.  The 
largest  cDNA  clone  identified  appeared  of  sufficient 
length  to  contain  most  or  all  of  the  Hantaan  M  RNA 
segment.  The  nucleotide  sequence  of  this  cDNA  clone 


was  determined  by  dideoxy  chain  termination  synthesis 
on  cDNA  templates  subcloned  into  bacteriophage 
Ml 3.  Sequences  identified,  adjacent  to  the  added 
homopolymeric  tail,  corresponded  to  all  but  the  six 
most  3'  proximal  nucleotides  reported  for  Hantaan 
virion  RNA  (3'-AUCAUCAUCUGAGGCGUUUUCUUUG) 
(Schmaljohn  eta/.,  1985).  Comparison  of  the  termini  of 
the  cDNA  clone  revealed  complementary  sequences 
for  1 8  bases  which  included  all  but  the  three  most  distal 
nucleotides  complementary  to  the  virion  RNA  se¬ 
quence.  Since  3'  and  5'  complementarity  is  a  general 
feature  of  the  RNA  segments  of  Bunyaviridae  (Parker 
and  Hewlett,  1981;  Bishop  era/.,  1982;  Cabridilla  et 
a/.,  1983;  lhara  eta/.,  1984;  Collett  et  a/.,  1985),  we 
concluded  that  the  large  cDNA  clone  represented  the 
entire  M  RNA  segment  except  for  six  bases  corre¬ 
sponding  to  the  3'  terminus  and  three  bases  corre¬ 
sponding  to  the  5'  terminus  of  virion  RNA. 

A  base-paired  stem  structure  with  a  calculated  free 
energy  of  -28.2  kcal/mol  was  postulated  based  on  this 
terminal  nucleotide  sequence  complementarity  (Fig.  1). 
A  base  composition  for  the  entire  virion  M  RNA  was 
determined  from  cDNA  sequence  information  to  be 
29.99b  A,  17.9%  G,  21.4%  C,  and  30.8%  U. 

Coding  capacity  and  gene  order  of  the  M  RNA 

A  single  long  open  reading  frame  was  detected  in 
the  nuclectide  sequence  corresponding  to  viral  com- 
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Fig.  1.  Predicted  secondary  structure  of  the  3'-  and  5'-terminal 
nucleotide  sequences  of  Hantaan  M  RNA. 
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Fig.  3.  Potential  polypeptide  coding  regions  of  the  six  possible  reading  frames  of  viral  (vRNA)  or  viral  complementary-sense  (cRNA)  RNA.  The 
solid  line  represents  Hantaan  M  RNA  in  base  pairs  X  1 03. 


plementary-sense  RNA  (Fig.  2).  Examination  of  all  six 
possible  reading  frames  of  the  cDNA  failed  to  reveal 
additional  ORFs  initiating  at  ATG  codons  and  encoding 
1 00  amino  acids  or  more  (Fig.  3).  Within  the  major  ORF, 
protein  synthesis  could  potentially  begin  at  either  of 
two  in-frame  initiation  codons  at  nucleotide  positions 
41-43  or  65-67.  According  to  Kozak  (1 978, 1 984),  the 
first  codon  has  more  favorable  flanking  sequences  for 
initiation  of  protein  synthesis.  A  total  of  1135  amino 
acids  are  encoded  from  the  first  ATG  until  a  termination 
codon  (TAG)  occurs  at  nucleotide  position  3346-3348. 
Similar  to  other  Bunyaviridae,  the  putative  gene  product 
of  the  Hantaan  M  RNA  was  found  to  be  relatively  cys¬ 
teine  rich  (Table  1)  and  of  sufficient  length  to  contain 
both  the  nonglycosylated  G1  and  G2  envelope  glyco¬ 
proteins,  which  have  estimated  Mr  values  of  62K  and 
52K,  respectively  (Table  1)  (Collett  et  at.,  1985;  Eshita 
et  al„  1985;  Lees  et  at.,  1986;  Schmaljohn  et  a!., 
1986a). 

In  order  to  determine  the  gene  order  of  the  Hantaan 
M  RNA  segment  and  to  locate  the  amino  termini  of  the 
G1  and  G2  glycoproteins,  virion  structural  proteins  were 
isola’ed  and  subjected  to  amino-terminal  sequence 
analysis.  A  partial  amino  acid  sequence  of  (NH2)-Leu-X- 
X-V  al  -T  yr- Asp-  Met-Lys-I  le  -Glu  -X-P  ro  -  H  is-Thr- 
Thr-Val  was  determined  for  G1  and  (NH2)-X-Glu- 
X-Pro-Leu-X-Pro-Val-Trp-Asn-Asp-Asn-Ala-His- 
Gly— Val— Gly  for  G2,  with  X  representing  unidentified 
amino  acids.  These  sequences  correspond  to  those 
derived  from  cDNA  beginning  at  nucleotide  positions 
95-97  (Gl)and  1985-1987  (G2).  Therefore,  the  gene 
order  of  the  Hantaan  M  RNA,  with  respect  to  viral  com¬ 
plementary-sense  RNA,  is  5'-G1-G2-3'. 

A  hydrophobicity/hydrophilicity  plot  of  the  potential 
gene  product  revealed  that  the  amino  terminus  of  G1 
follows  a  hydrophobic  stretch  of  1 8  amino  acids  from 
the  first  ATG  of  the  ORF  (Fig.  4).  Similarly,  the  amino 
terminus  of  G2,  which  originates  at  amino  acid  649  of 
the  ORF,  follows  a  hydrophobic  sequence.  To  estimate 


the  position  of  the  carboxy  termini  of  Gl  and  G2,  pep¬ 
tides  corresponding  to  amino  acids  588-614  (NH2-Tyr- 
Lys-Val-Cys-GIn-Val-Thr-His-Arg-Phe-Arg-Asp- 
Asp-Leu-Lys-Lys-Thr-Val-Thr-Pro-GIn-Asn-Phe- 
Thr-Pro-Gly-Cys-COOH)  and  1127-1135  (NH2-Cys- 
Pro-Val-Arg-Lys-His-Lys-Lys-Ser-COOH)  of  the 
ORF  were  synthesized  and  antisera  to  each  were  gen¬ 
erated  in  rabbits.  The  ability  of  the  antipeptide  sera  to 
recognize  Gl  and  G2  was  measured  by  immune  pre¬ 
cipitation  of  native  proteins  and  by  hybridization  to  SDS- 
denatured  proteins  bound  to  nitrocellulose  membranes. 
Immune  precipitation  of  radiolabeled  Hantaan  virus 


TABLE  1 

Amino  Acid  Composition  of  Hantaan  M  RNA  Gene  Products* 


Name 

Number 

Percentage 

A— Alanine 

52 

4.6 

C — Cysteine 

61 

5.4 

D — Aspartic  acid 

50 

4.4 

E — Glutamic  acid 

58 

5.1 

F — Phenylalanine 

58 

5.1 

G — Glycine 

80 

7.0 

H — Histidine 

37 

3.3 

1 — Isoleucine 

76 

6.7 

K — Lysine 

67 

5.9 

L — Leucine 

86 

7.6 

M — Methionine 

23 

2.0 

N — Asparagine 

40 

3.5 

P — Proline 

51 

4.5 

Q — Glutamine 

34 

3.0 

R — Arginine 

34 

3.0 

S — Senne 

98 

8.6 

T — Threonine 

81 

7.1 

V — Valine 

88 

7.8 

W — Tryptophan 

19 

1.7 

Y — Tyrosine 

42 

3.7 

•  Deduced  amino  acids  of  the  major  open  reading  frame  extending 
from  nucleotides  41  to  3348.  The  predicted  molecular  weight  of  this 
polypeptide  is  126,300. 
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Fig.  4.  A  hydrophobicity/hydrophilicity  plot  of  the  gene  product  predicted  from  the  cDNA  sequence  of  viral  complementary-sense  Hantaan 
M  RNA.  Data  points  represent  a  running  average  taken  over  seven  amino  acid  residues.  Sequences  with  net  hydrophiiicity  appear  above  the 
line  and  are  shaded;  sequences  with  net  hydrophobicity  appear  below  the  line.  Values  assigned  to  amino  acids  are  Ala.  -0.5;  His,  -0.5;  Arg, 
3.0.  Asp,  3.0;  Cys,  -1.0;  Met,  -1.3;  Glu,  3.0;  Lys,  3.0;  Val.  -1.5;  lie,  -1.8:  Ser,  0.3;  Asn,  0.2;  Leu,  -1.8;  Tyr,  -2.3;  Gin,  0.2;  Gly,  0.0;  Phe, 
-2.5;  Trp,  -3.4;  Pro.  0.0;  Thr,  -0.4.  The  coding  regions  of  G1  and  G2  are  overscored  with  a  dashed  line  and  the  regions  corresponding  to 
synthetic  peptides  are  marked  with  a  *. 


polypeptides  from  infected  cell  lysates  demonstrated 
that  antisera  to  peptide  588-614  specifically  precipi¬ 
tated  G1 ,  and  antisera  to  peptide  1127-1135  precipi¬ 
tated  G2  (Fig.  5A).  Specificities  of  the  antipeptide  sera 
were  confirmed  by  probing  blots  of  virion  proteins  elec- 
trophoretically  transferred  from  polyacrylamide  gels  to 
nitrocellulose  membranes.  Rabbit  antipeptide  1 1 27- 
1135  sera  reacted  with  G2  and  antipeptide  588-614 
sera  reacted  with  G1  (Figs.  5B  and  5C).  Anti-Hantaan, 
polyclonal  rabbit  sera,  and  hyperimmune-mouse  ascitic 
fluid  were  reactive  with  nucleocapsid  protein,  and  to  a 
lesser  extent,  G2.  but  not  with  G1.  Preimmune  rabbit 
sera,  normal  mouse  ascitic  fluid,  and  a  mixture  of 
monoclonal  antibody-containing  mouse  ascitic  fluids, 
directed  against  G1  or  G2.  did  not  react  with  any  poly¬ 
peptides  on  the  immunoblots  (Fig.  5C).  Since  the  G2 
peptide  represents  the  exact  3'  nucleotide  sequences 
of  the  ORF,  these  data  indicate  the  G2  extends  from 
amino  acid  649  to  1127-1 135  and  has  a  calculated 
nol  wt  of  53,743.  Although  it  was  not  possible  to  pre¬ 
cisely  define  the  carboxy  terminus  of  G1 ,  the  immune- 
precipitation  data  indicate  that  G1  extends  at  least  from 
amino  acid  19  to  588-614  and,  consequently,  has  an 
approximate  mol  wt  of  64,000.  Both  of  these  values 
are  consistent  with  estimated  molecular  weights  for 
nonglycosylated  Hantaan  G1  and  G2  previously  re¬ 
ported  (Schmaljohn  ef  a/.,  1 986a). 

Five  potential  asparagine-linked  glycosylation  sites 
(Asn-X-Ser/Thr)  were  identified  in  G1  and  two  in  G2 
(Figs.  2  and  3).  These  data  are  consistent  with  the  more 
highly  glycosylated  nature  of  Gl;  however,  we  have 
not  yet  determined  how  many  of  these  sites  are  actually 
used  (Schmaljohn  ef  a/.,  1 986a). 

DISCUSSION 

Nucleotide  sequence  analysis  of  the  M  genome  seg¬ 
ment  of  Hantaan  virus  revealed  that  Hantaan  shares 


the  common  Bunyaviridae  property  of  encoding  a  single 
gene  product  in  the  viral  complementary-sense  RNA. 
Although  we  have  not  demonstrated  the  occurrence 
of  a  polyprotein  precursor  to  the  glycoproteins  in  in¬ 
fected  cells,  we  assume  that  both  envelope  glycopro¬ 
teins  are  processed  from  this  gene  product,  either  by 
cotranslational  or  post-translational  cleavage.  The  gene 
order  of  Hantaan  M  RNA  with  respect  to  messenger- 
sense  RNA  is  5'-Gl-G2-3'.  Hantaan  apparently  shares 
the  common  (Bunyaviridae)  property  of  a  leader  se¬ 
quence  before  the  first  encoded  glycoprotein,  as  has 
been  observed  for  the  phlebo-  and  bunyaviruses  (Collett 
etal.,  1985;  lhara eta/.,  1985;  Eshitaeta/.,  1984;  Lees 
ef  a/.,  1986).  The  18  amino  acids  following  the  first 
AUG  of  the  Hantaan  M  segment  ORF  and  preceding 
the  amino  terminus  of  mature  Gl  constitute  a  typical 
glycoprotein  signal  sequence  (von  Heijne,  1983).  The 
absence  of  a  "NSM”  coding  region  between  the  signal 
sequence  and  Gl  distinguishes  Hantaan  from  the 
phleboviruses  Punta  Toro  and  Rift  Valley  fever  viruses, 
which  have  long  (30K  and  1 6K,  respectively)  stretches 
of  polypeptides  prior  to  the  amino-terminal  sequences 
of  their  first  mature  glycoprotein  (Collett  ef  a/.,  1985; 
lhara  ef  a/.,  1985). 

Sequence  information  for  a  polypeptide  of  approxi¬ 
mately  70.3  k  Da  extends  from  the  region  between  the 
amino  terminus  of  Gl  to  the  amino  terminus  of  G2  and 
is  considerably  more  than  the  6 1 K  previously  reported 
for  nonglycosylated  Gl  (Schmaljohn  ef  a/.,  1986a).  This 
discrepancy  suggests  that  a  small  intergenic  region 
between  the  carboxy  terminus  of  Gl  and  the  amino 
terminus  of  G2,  similar  to  that  predicted  for  Rift  Vally 
fever  virus,  may  be  present  (Collett  ef  a/.,  1985).  Data 
presented  in  this  paper  demonstrate  that  the  carboxy 
terminus  of  Gl  extends  at  least  to  amino  acid  residues 
588-614  and  thus  limits  the  size  of  the  proposed  in¬ 
tergenic  region  to  less  than  6K.  Within  this  putative 
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Fig.  5.  (A)  Polyacrylamide  gel  electrophoresis  of  polypeptides  immune  precipitated  from  Hamaan-infecied  cell  lysates  with  antipeptide  sera. 
Lane  1  contains  uC-methylated  protein  markers  of  molecular  weights  indicated.  Lane  2  contains  Hantaan  polypeptides  precipitated  with 
polyclonal  anti-Hantaan  sera.  The  more  highly  radiolabeled  nucleocapsid  protein  (NC)  migrates  slightly  faster  than  the  G2  envelope  glycoprotein. 
Lanes  3-6  contain  Hantaan  polypeptides  immune  precipitated  with  the  following  antipeptide  immune  sera:  (3)  preimmune  peptide  588-614, 
(4)  antipeptide  588-614,  (5)  preimmune  peptide  1 127-1 135,  (6)  antipeptide  1127-1135.  (B)  Hantaan  virion  proteins  electrophoretically  transferred 
to  nitrocellulose  membranes  and  probed  with  immune  sera.  Numbers  to  the  left  of  lane  1  correspond  to  migration  of  prestained  molecular 
weight  markers  (Bethesda  Research  Laboratories)  and  are  not  visible  by  autoradiography.  Lanes  2-4  were  reacted  with  (2)  polyclonal,  anti- 
Hantaan  rabbit  sera,  (3)  antipeptide  1 127-1 135  (rabbit  A),  (4)  antipeptide  1 127-1135  (rabbit  B).  (C)  Lane  1  contains  Hantaan  viral  proteins 
radiolabeled  with  [35S)methionme.  Lanes  2-8  are  immunoblots  of  virion  proteins  reacted  with  (2)  normal  mouse  serum;  (3)  preimmune  peptide 
588-614;  (4)  antipeptide  588-614  (rabbit  A);  (5)  antipeptide  588-614  (rabbit  B);  (6)  mixture  of  five  monoclonal  antibody  mouse  ascitic  fluids 
specific  for  Hantaan  G1  or  G2;  (7)  polyclonal  anti-Hantaan  rabbit  sera;  (8)  hyperimmune  anti-Hantaan  mouse  ascitic  fluid. 


intergenic  region  are  amino  acids  which  also  conform 
to  many  of  the  parameters  defining  a  signal  sequence. 
In  addition,  amino  acids  627-645  have  a  calculated 
average  hydropathy  value  of  1.82  (Kyte  and  Doolittle, 
1 982),  which  is  consistent  with  a  membrane-spanning 
region.  A  similar  6K  membrane-spanning  intergenic  re¬ 
gion,  which  is  thought  to  serve  as  a  signal  sequence, 
has  been  demonstrated  between  the  two  envelope 
protein  coding  sequences  of  the  alphaviruses  (Garoff 
era/.,  1980;  Strauss  and  Strauss,  1986). 

The  carboxy  terminus  of  G2  was  readily  defined, 
since  synthetic  peptides  derived  from  the  last  coding 
sequences  of  the  ORF  were  found  to  elicit  anti-G2  an¬ 
tibodies  in  rabbits.  The  amino  acids  adjacent  to  the 
carboxy  terminus  of  G2  (1 107-1127)  were  found  to  be 
extremely  hydrophobic,  with  an  average  hydropathy 
charge  of  3.32,  and  were  followed  by  a  stretch  of  hy¬ 
drophilic  residues,  suggesting  the  presence  of  a  mem¬ 
brane-anchoring  protein  region.  The  precise  location 
of  membrane-associated  regions  of  G1  and  G2  remains 
to  be  determined. 

Bunyaviridae  generally  mature  intracellularly  in  the 


vicinity  of  the  Golgi  and  infected  cells  do  not  usually 
express  viral  antigens  on  their  plasma  membranes  (for 
a  review  see  Bishop,  1985).  Therefore,  neutralizing  an¬ 
tibodies  which  react  with  G1  and/or  G2  may  play  a 
predominant  role  in  recovery  and  immunity.  A  clear  un¬ 
derstanding  of  the  mechanism  of  viral  neutralization 
becomes  especially  relevant  for  the  Hantaviruses  be¬ 
cause  these  viruses  are  unique  among  the  Bunyaviridae 
in  their  ability  to  cause  natural  persistent  infections  of 
their  rodent  hosts  and  to  be  exclusively  transmitted  by 
aerosols  (rather  than  by  any  known  arthropod  vector). 
For  these  reasons,  it  is  important  to  determine  the  role 
of  the  envelope  proteins  in  Hantaan  viral  infection  and 
immunity.  Presently,  little  is  known  about  the  function 
of  G1  and  G2,  except  that  a  monoclonal  antibody  to 
G1  can  neutralize  viral  infectivity  (unpublished  data). 
We  have  examined  the  molecular  properties  of  the 
genes  encoding  G1  and  G2  and  are  continuing  efforts 
to  identify  the  functional  characteristics  of  these  pro¬ 
teins.  These  data  should  provide  a  basis  for  studies 
defining  the  molecular  and  antigenic  nature  of  Hantaan 
viral  infection  and  pathogenesis. 
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